智能论文笔记

Energy-Sensitive Trajectory Design and Restoration Areas Allocation for UAV-Enabled Grassland Restoration

Dongbin Jiao , Lingyu Wang , Peng Yang , Weibo Yang , Yu Peng , Zhanhuan Shang , Fengyuan Ren

分类：神经与进化计算

2022-04-10

草原修复是保护草原生态退化的关键手段。为了减轻广泛的人类劳动并提高了恢复效率，无人机的全自动能力很有希望，但仍在等待被利用。本文通过在计划草地修复时明确考虑了无人机和草地退化的现实限制来推动这项新兴技术。为此，在有限的无人机电池能量，草种子的重量，恢复区域的数量以及相应的尺寸下，在数学上以数学建模为数学建模。然后，我们分析了这些原始问题通过考虑这些限制，即最短的飞行路径和最佳区域分配出现了两个冲突目标。结果，恢复区域的最大化是轨迹设计问题和高度耦合区域分配问题的综合。从优化的角度来看，这需要解决旅行推销员问题（TSP）和多维背包问题（MKP）的两个NP硬问题。为了解决这个复杂的问题，我们提出了一种称为Chapbilm的合作优化算法，以通过利用它们之间的相互依赖性来交入解决这两个问题。多个模拟验证轨迹设计与区域分配之间的冲突。合作优化算法的有效性也得到了与传统优化方法的比较，这些方法不利用两个问题之间的相互依赖性。结果，提出的算法以近乎理想的方式成功地解决了多个仿真实例。

translated by 谷歌翻译

CountingMOT: Joint Counting, Detection and Re-Identification for Multiple Object Tracking

Weihong Ren , Bowen Chen , Yuhang Shi , Weibo Jiang , Honghai Liu

分类：计算机视觉

2022-12-12

The recent trend in multiple object tracking (MOT) is jointly solving detection and tracking, where object detection and appearance feature (or motion) are learned simultaneously. Despite competitive performance, in crowded scenes, joint detection and tracking usually fail to find accurate object associations due to missed or false detections. In this paper, we jointly model counting, detection and re-identification in an end-to-end framework, named CountingMOT, tailored for crowded scenes. By imposing mutual object-count constraints between detection and counting, the CountingMOT tries to find a balance between object detection and crowd density map estimation, which can help it to recover missed detections or reject false detections. Our approach is an attempt to bridge the gap of object detection, counting, and re-Identification. This is in contrast to prior MOT methods that either ignore the crowd density and thus are prone to failure in crowded scenes, or depend on local correlations to build a graphical relationship for matching targets. The proposed MOT tracker can perform online and real-time tracking, and achieves the state-of-the-art results on public benchmarks MOT16 (MOTA of 77.6), MOT17 (MOTA of 78.0%) and MOT20 (MOTA of 70.2%).

translated by 谷歌翻译

Adaptive Perception Transformer for Temporal Action Localization

Yizheng Ouyang , Tianjin Zhang , Weibo Gu , Hongfa Wang , Liming Wang , Xiaojie Guo

分类：计算机视觉

2022-08-25

时间动作本地化旨在预测未修剪长视频中每个动作实例的边界和类别。基于锚或建议的大多数先前方法忽略了整个视频序列中的全局本地上下文相互作用。此外，他们的多阶段设计无法直接生成动作边界和类别。为了解决上述问题，本文提出了一种新颖的端到端模型，称为自适应感知变压器（简称apperformer）。具体而言，Adaperformer探索了双支球多头的自我发项机制。一个分支会照顾全球感知的关注，该注意力可以模拟整个视频序列并汇总全球相关环境。而其他分支集中于局部卷积转移，以通过我们的双向移动操作来汇总框架内和框架间信息。端到端性质在没有额外步骤的情况下产生视频动作的边界和类别。提供了广泛的实验以及消融研究，以揭示我们设计的有效性。我们的方法在Thumos14数据集上实现了最先进的准确性（根据map@0.5、42.6 \％map@0.7和62.7 \％map@avg），并在活动网络上获得竞争性能， -1.3数据集，平均地图为36.1 \％。代码和型号可在https://github.com/soupero/adaperformer上找到。

translated by 谷歌翻译

Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization

Mengyin Liu , Chao Zhu , Hongyu Gao , Weibo Gu , Hongfa Wang , Wei Liu , Xu-cheng Yin

分类：计算机视觉 | 自然语言处理

2022-07-15

随着电子商务行业的繁荣，将各种方式（例如愿景和语言）用于描述产品项目。了解这种多样化的数据是一个巨大的挑战，尤其是通过有用的图像区域提取文本序列中的属性值对。尽管以前的一系列作品已致力于这项任务，但很少有人研究障碍，阻碍了进一步的进一步改进：1）上流单模式预处理的参数不足，而无需在下游多人中进行适当的合理微调。 - 模式任务。 2）要选择图像的描述性部分，不管先验的信息应通过更强的编码器将与语言相关的信息编码为常见的语言嵌入空间，从而广泛应用了简单的晚期融合。 3）由于产品之间的多样性，它们的属性集往往差异很大，但是当前的方法以不必要的最大范围预测，并带来更多潜在的假阳性。为了解决这些问题，我们在本文中提出了一种新颖的方法，可以通过统一学习方案和动态范围最小化提高多模式电子商务属性的价值提取：1）首先，统一方案旨在共同培训多模式任务带有预审计的单模式参数。 2）其次，提出了一种文本引导的信息范围最小化方法，以将每种模态的描述性部分自适应地编码为具有强大审慎的语言模型的相同空间。 3）此外，提出了一种原型引导的属性范围最小化方法，以首先确定当前产品的适当属性集，然后选择原型以指导所选属性的预测。关于流行的多模式电子商务基准的实验表明，我们的方法比其他最新技术的方法更出色。

translated by 谷歌翻译

Novel total hip surgery robotic system based on self-localization and optical measurement

Weibo Ning , Jiaqi Zhu , Hongjiang Chen , Weijun Zhou , Shuxing He , Yecheng Tan , Qianrui Xu , Ye Yuan , Jun Hu , Zhun Fan

分类：机器人

2022-06-15

本文介绍了全髋关节置换术（THA）的手术机器人系统的开发和实验评估。尽管在关节置换手术中使用的现有机器人系统已经取得了一些进展，但机器人组必须在操作过程中准确地位于目标位置，这在很大程度上取决于外科医生的经验。此外，手持式髋臼铰刀通常表现出不均匀的强度和研磨文件。此外，缺乏实时测量股骨颈长度的技术可能导致结果不佳。为了应对这些挑战，我们提出了一种可实时可追溯的光学定位策略，以减少手术期间对机器人臂的不必要的手动调整，一种稳定磨削的最终效用系统，以及提供股骨颈的实时测量的光学探测器长度和其他参数用于选择适当的假体。下肢的长度在安装假体时测量。实验评估结果表明，根据其准确性，执行能力和鲁棒性，提出的手术机器人系统对于THA是可行的。

translated by 谷歌翻译

Characterizing Multi-Domain False News and Underlying User Effects on Chinese Weibo

Qiang Sheng , Juan Cao , H. Russell Bernard , Kai Shu , Jintao Li , Huan Liu

分类：自然语言处理

2022-05-06

在过去几年中，社交媒体上传播的错误消息激增，并导致了现实世界中的多种威胁。尽管有关于特定领域的虚假新闻（例如政治或医疗保健）的研究，但比较跨领域的虚假新闻几乎没有工作。在本文中，我们调查了2009年至2019年中国最大的Twitter式社交媒体平台的微博上的九个领域的虚假新闻。新收集的数据包含44,728个帖子，由40,215个用户发布，并重新发布了。 340万次。基于多域数据集的分布和传播，我们观察到，在诸如健康和医学之类的日常生活的领域中，虚假的消息比政治等其他领域的帖子更有效，但有效地传播的帖子较少，而政治虚假新闻具有最有效的扩散能力。关于微博上广泛散布的虚假新闻帖子与某些类型的用户（按性别，年龄等。此外，这些帖子都引起了重新播放的强烈情绪，并随着False-News启动器的积极参与而进一步扩散。我们的发现有可能在可疑新闻发现，真实性预测以及显示和解释中帮助设计错误的新闻检测系统。微博上的发现与现有作品的发现表明了细微的模式，这表明需要对来自不同平台，国家或语言的数据进行更多研究，以解决全球错误新闻。代码和新的匿名数据集可在https://github.com/ictmcg/characterizing-weibo-multi-domain-false-news上找到。

translated by 谷歌翻译

Backdoor Attacks Against Dataset Distillation

Yugeng Liu , Zheng Li , Michael Backes , Yun Shen , Yang Zhang

分类：机器学习

2023-01-03

Dataset distillation has emerged as a prominent technique to improve data efficiency when training machine learning models. It encapsulates the knowledge from a large dataset into a smaller synthetic dataset. A model trained on this smaller distilled dataset can attain comparable performance to a model trained on the original training dataset. However, the existing dataset distillation techniques mainly aim at achieving the best trade-off between resource usage efficiency and model utility. The security risks stemming from them have not been explored. This study performs the first backdoor attack against the models trained on the data distilled by dataset distillation models in the image domain. Concretely, we inject triggers into the synthetic data during the distillation procedure rather than during the model training stage, where all previous attacks are performed. We propose two types of backdoor attacks, namely NAIVEATTACK and DOORPING. NAIVEATTACK simply adds triggers to the raw data at the initial distillation phase, while DOORPING iteratively updates the triggers during the entire distillation procedure. We conduct extensive evaluations on multiple datasets, architectures, and dataset distillation techniques. Empirical evaluation shows that NAIVEATTACK achieves decent attack success rate (ASR) scores in some cases, while DOORPING reaches higher ASR scores (close to 1.0) in all cases. Furthermore, we conduct a comprehensive ablation study to analyze the factors that may affect the attack performance. Finally, we evaluate multiple defense mechanisms against our backdoor attacks and show that our attacks can practically circumvent these defense mechanisms.

translated by 谷歌翻译

PMT-IQA: Progressive Multi-task Learning for Blind Image Quality Assessment

Qingyi Pan , Ning Guo , Letu Qingge , Jingyi Zhang , Pei Yang

分类：计算机视觉

2023-01-03

Blind image quality assessment (BIQA) remains challenging due to the diversity of distortion and image content variation, which complicate the distortion patterns crossing different scales and aggravate the difficulty of the regression problem for BIQA. However, existing BIQA methods often fail to consider multi-scale distortion patterns and image content, and little research has been done on learning strategies to make the regression model produce better performance. In this paper, we propose a simple yet effective Progressive Multi-Task Image Quality Assessment (PMT-IQA) model, which contains a multi-scale feature extraction module (MS) and a progressive multi-task learning module (PMT), to help the model learn complex distortion patterns and better optimize the regression issue to align with the law of human learning process from easy to hard. To verify the effectiveness of the proposed PMT-IQA model, we conduct experiments on four widely used public datasets, and the experimental results indicate that the performance of PMT-IQA is superior to the comparison approaches, and both MS and PMT modules improve the model's performance.

translated by 谷歌翻译

MGTAB: A Multi-Relational Graph-Based Twitter Account Detection Benchmark

Shuhao Shi , Kai Qiao , Jian Chen , Shuai Yang , Jie Yang , Baojie Song , Linyuan Wang , Bin Yan

分类：计算机视觉

2023-01-03

The development of social media user stance detection and bot detection methods rely heavily on large-scale and high-quality benchmarks. However, in addition to low annotation quality, existing benchmarks generally have incomplete user relationships, suppressing graph-based account detection research. To address these issues, we propose a Multi-Relational Graph-Based Twitter Account Detection Benchmark (MGTAB), the first standardized graph-based benchmark for account detection. To our knowledge, MGTAB was built based on the largest original data in the field, with over 1.55 million users and 130 million tweets. MGTAB contains 10,199 expert-annotated users and 7 types of relationships, ensuring high-quality annotation and diversified relations. In MGTAB, we extracted the 20 user property features with the greatest information gain and user tweet features as the user features. In addition, we performed a thorough evaluation of MGTAB and other public datasets. Our experiments found that graph-based approaches are generally more effective than feature-based approaches and perform better when introducing multiple relations. By analyzing experiment results, we identify effective approaches for account detection and provide potential future research directions in this field. Our benchmark and standardized evaluation procedures are freely available at: https://github.com/GraphDetec/MGTAB.

translated by 谷歌翻译

KoopmanLab: A PyTorch module of Koopman neural operator family for solving partial differential equations

Wei Xiong , Muyuan Ma , Pei Sun , Yang Tian

分类：机器学习

2023-01-03

Given the increasingly intricate forms of partial differential equations (PDEs) in physics and related fields, computationally solving PDEs without analytic solutions inevitably suffers from the trade-off between accuracy and efficiency. Recent advances in neural operators, a kind of mesh-independent neural-network-based PDE solvers, have suggested the dawn of overcoming this challenge. In this emerging direction, Koopman neural operator (KNO) is a representative demonstration and outperforms other state-of-the-art alternatives in terms of accuracy and efficiency. Here we present KoopmanLab, a self-contained and user-friendly PyTorch module of the Koopman neural operator family for solving partial differential equations. Beyond the original version of KNO, we develop multiple new variants of KNO based on different neural network architectures to improve the general applicability of our module. These variants are validated by mesh-independent and long-term prediction experiments implemented on representative PDEs (e.g., the Navier-Stokes equation and the Bateman-Burgers equation) and ERA5 (i.e., one of the largest high-resolution data sets of global-scale climate fields). These demonstrations suggest the potential of KoopmanLab to be considered in diverse applications of partial differential equations.

translated by 谷歌翻译